Safe Reinforcement Learning for Transition Control of Ducted-Fan UAVs
نویسندگان
چکیده
Ducted-fan tail-sitter unmanned aerial vehicles (UAVs) provide versatility and unique benefits, attracting significant attention in various applications. This study focuses on developing a safe reinforcement learning method for back-transition control between level flight mode hover ducted-fan UAVs. Our enables transition with minimal altitude change time while adhering to the velocity constraint. We employ Trust Region Policy Optimization, Proximal Optimization Lagrangian, Constrained (CPO) algorithms controller training, showcasing superiority of CPO algorithm necessity The trajectory achieved using closely resembles optimal obtained via well-known GPOPS-II software SNOPT solver. Meanwhile, also exhibits strong robustness under unknown perturbations UAV model parameters wind disturbance.
منابع مشابه
Improving Control System Effectiveness for Ducted Fan Vtol Uavs Operating in Crosswinds
A valuable new resource being developed for today’s soldier are small ducted fan VTOL UAVs. Although beneficial in many ways, a major operational problem of ducted fan vehicles is precise control when flying in crosswinds and turbulent conditions in general. There are two significant, inherent issues associated with ducted fan control in crosswinds; 1) lateral momentum drag and 2) a duct stabil...
متن کاملComparison of Nonlinear Control Designs for a Ducted Fan Model
In this paper we compare diierent nonlinear control design methods by applying them to the planar model of a ducted fan engine. Those design can be divided into two steps. The rst step requires the derivation of a Control Lyapunov Function (CLF), while the second involves using an existing CLF to generate a controller. The main premise of this paper is that by combining the best of these two ph...
متن کاملLyapunov Design for Safe Reinforcement Learning Control
We propose a general approach to safe reinforcement learning control based on Lyapunov design methods. In our approach, a Lyapunov function—a special form of domain knowledge—is used to formulate the action choices available to a reinforcement learning agent. A learning agent choosing among these actions provably enjoys performance guarantees, and satisfies safety constraints of various kinds. ...
متن کاملNoise Source Identification for Ducted Fan Systems
Understanding combustion noise source mechanisms, designing efficient acoustic liners and optimising active control algorithms for noise reduction requires the identification of the frequency and modal content of the combustion noise contribution. Coherence-based noise source identification techniques have been developed which can be used to identify the contribution of combustion noise to near...
متن کاملLinear Parameter-Varying Control of a Ducted Fan Engine
Parameter-dependent control techniques are applied to a thrust vectored ducted fan engine. The synthesis technique is based on the solution of Linear Matrix Inequalities and produces a controller which achieves speci ed performance against the worst-case time variation of measurable parameters entering the plant in a linear fractional manner. Thus the plant can have widely varying dynamics over...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Drones
سال: 2023
ISSN: ['2504-446X']
DOI: https://doi.org/10.3390/drones7050332